Prediction of Accent Commands for the Fujisaki Intonation Model
نویسندگان
چکیده
This paper presents a model to predict the accent commands (henceforth ACs) of the Fujisaki Model for the F0 contour, being known the phrase commands (henceforth FCs). Accent commands are associated with syllables. For each syllable, an artificial neural network (ANN) decides, with an accuracy of 89.4% whether there will be an associated AC or not. For syllables with associated AC, the amplitude, Aa, the onset time anticipation, T1a, and the offset time anticipation, T2a, are predicted by additional ANNs, with resulting linear correlation coefficient of 0.602, 0.743 and 0.650, respectively. The features used for each ANN are presented and discussed. Finally a comparison between target and predicted F0 contour is presented.
منابع مشابه
A quantitative description of German prosody offering symbolic labels as a by-product
The prosodic quality of a text-to-speech system is important for the intellegibility and perceived naturalness of synthetic speech. In earlier works the author developed a linguistically motivated model of German intonation based on the quantitative Fujisaki model of the production process of F0. The current paper compares results yielded by automatic Fujisaki modeling with a GToBI-style anotat...
متن کاملR Eal - Time Manipulation of the F 0 - Contour in Synthetic Speech Using the F Ujisaki Model
In this paper, we propose a system that allows the user of a real-time speech synthesizer to directly manipulate the F0 contour of an utterance on-line and in real-time. The intonation is generated by the Fujisaki Model, which creates the F0 contour based on accent and phrase commands that the user needs to trigger. These input commands to the model can be generated by the user with the buttons...
متن کاملThe influence of speech rate on Fujisaki model parameters
The current paper examines influences of speech rate on Fujisaki model parameters based on read speech from the BonnTempo-Corpus containing productions by 12 native speakers of German at five different intended tempo levels (very slow, slow, normal, fast, fastest possible). The normal condition was produced at an average rate of 6.34 syllables/s or 100%, the very slow version at 67%, and the fa...
متن کاملThe influence of syllable structure on the timing of intonational events in German
The present study deals with the in uence of syllable structure on the the ne alignment of accent commands of the Fujisaki-model. The corpus used in this study consists of three-syllable words of German with word-accent on the second syllable which were uttered in citationform. It is examined which factors in uence accent command onset time T1 and accent command o set time T2. T1 can be predict...
متن کاملOn the Alignment of Prosodic Events
The current study examines the relationship between intonational gestures as given by the accent commands of the Fujisaki model and the syllabic grid on the example of spontaneous American English from the Buckeye Corpus. As an initial step the data were labelled according to American English ToBI conventions. Intensity contours were extracted from the band-filtered speech signal and modelled u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004